Modeling Text through Gaussian Processes
نویسندگان
چکیده
This paper proposes a continous space text model based on Gaussian processes. Introducing latent coordinates of words over which the Gaussian process is defined, we can encode word correlations directly and lead to a model that performs better than mixture models. Our model would serve as a foundation of more complex text models and also as a statistical visualization of texts.
منابع مشابه
Towards Indefinite Gaussian Processes
Gaussian processes (GPs) enable probabilistic kernel-machines with remarkable modeling efficacy and GPML toolbox facilitates a widespread use by practitioners and researchers. Many modern applications demand non-metric (dis)similarities. As a result, Mercer’s condition for positive semidefiniteness is violated. Through a simple text categorization example that involves a KL-divergence based ker...
متن کاملGaussian processes in Bayesian modeling : Manual for Matlab toolbox
(This is an early version of the manual, which is still subject to some modifications. The text contains still errors in some details but great picture is correctly described.)
متن کاملTwitter-Network Topic Model: A Full Bayesian Treatment for Social Network and Text Modeling
Twitter data is extremely noisy – each tweet is short, unstructured and with informal language, a challenge for current topic modeling. On the other hand, tweets are accompanied by extra information such as authorship, hashtags and the user-follower network. Exploiting this additional information, we propose the Twitter-Network (TN) topic model to jointly model the text and the social network i...
متن کاملModeling Tweet Arrival Times using Log-Gaussian Cox Processes
Research on modeling time series text corpora has typically focused on predicting what text will come next, but less well studied is predicting when the next text event will occur. In this paper we address the latter case, framed as modeling continuous inter-arrival times under a logGaussian Cox process, a form of inhomogeneous Poisson process which captures the varying rate at which the tweets...
متن کاملProperties of Spatial Cox Process Models
Probabilistic properties of Cox processes of relevance for statistical modeling and inference are studied. Particularly, we study the most important classes of Cox processes, including log Gaussian Cox processes, shot noise Cox processes, and permanent Cox processes. We consider moment properties and point process operations such as thinning, displacements, and superpositioning. We also discuss...
متن کامل